Tags → #neural networks

3 Mar 2025
Notes from Intro to RL
This is a summary from my understanding of reinforcement learning, based on the book Reinforcement Learning: An Introduction by Sutton and Barto, and supplemented with the YouTube series.
23 Feb 2025
Flash Attention on Tenstorrent Hardware
Trying to understand how Flash Attention works on Tenstorrent and how it compares to CUDA
7 Sept 2024
Understanding AdaNorm
Understanding Adaptive Layer Normalization. First introduced in the DiT paper
31 Jul 2024
Understanding Squared attention
Just a brief explanation of how attention mechanism works. As well as the quadratic scaling of attention.
1 May 2024
Layernorm
layer normalization of GPT by Andrej Karpathy
30 Mar 2024
Decoder Transformer
How I understand the Decoder Transformer in Generative Text Models
25 Mar 2024
Named Entity Recognition for Mammograph Radiology Reporting
This was my attempt at ner for medical reporting in technofest -- had to step down before qualifying stage

Notes from Intro to RL